Large vocabulary speaker independent isolated word recognition for embedded systems
نویسندگان
چکیده
In this paper the implementation of a word-stem based tree search for large vocabulary speaker independent isolated word recognition for embedded systems is presented. Two fast search algorithms combine the effectiveness of the tree structure for large vocabularies and the fast Viterbi search within the regular structures of word-stems. The algorithms are proved to be very effective for workstation and embedded platform realizations. In order to decrease the processing power the word-stem based tree search with frame dropping approach is used. The recognition speed was increased by a factor of 5 without frame dropping and by a factor of 10 with frame dropping in comparison to linear Viterbi search for isolated word recognition task with a vocabulary of 20102 words. Thus, the large vocabulary isolated word recognition becomes possible for embedded systems.
منابع مشابه
Speaker - Independent Isolated Word Recognition for a Moderate Size ( 54 Word ) Vocabulary
Recent work at Bell Laboratories has shown that statistical clustering techniques could be used to provide a reliable set of reference templates for a speaker-independent isolated-word recognition system. The vocabulary on which the system was tested consisted of the 26 letters of the alphabet, the 10 digits (0 to 9), and 3 command words. Since this vocabulary consisted of a large number of aco...
متن کاملSpeaker independent recognition of isolated words using clustering techniques
A speaker-independent isolated word recognition system is described which is based on the use of multiple templates for each word in the vocabulary. The word templates are obtained from a statistical clustering analysis of a large database consisting of 100 replications of each word (i.e., once by each of 100 bIkers). The recognition system, which accepts telephone quality speech input, is base...
متن کاملAcoustic Modeling of Subword Units for Large Vocabulary Speaker Independent Speech Recognition
The field of large vocabulary, continuous speech recognition has advanced to the point where there are several systems capable of attaining between 90 and 95% word accuracy for speaker independent recognition of a 1000 word vocabulary, spoken fluently for a task with a perplexity (average word branching factor) of about 60. There are several factors which account for the high performance achiev...
متن کاملA modified K-means clustering algorithm for use in isolated work recognition
Studies of isolated word recognition systems have shown that a set of carefully chosen templates can be used to bring the performance of speaker-independent systems up to that of systems trained to the individual speaker. The earliest work in this area used a sophisticated set of pattern recognition algorithms in a human-interactive mode to create the set of templates (multiple patterns) for ea...
متن کاملInteractive Clustering Techniques for Selecting Speaker-Independent Reference Templates for Isolated Word Recognition
It is demonstrated that clustering can be a powerful tool for selecting reference templates for speaker-independent word recognition. We describe a set of clustering techniques specifically designed for this purpose. These interactive procedures identify coarse structure, fine structure, overlap of, and outliers from clusters. The techniques have been applied t a large speech data base consisti...
متن کامل